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I.  A  SUMMARY  OF  AN  ALGORITHM  PROPOSED  BY  A.  J.  FEDEROWICZ  FOR  THE  FIXED 
^COST  LINEAR  PROGRAMMING  PROBLEM 

In  hie  paper,  Federowicz  proposes  an  algorithm  for  solving  the  *fixed 

P 

cost  linear  programming  problem"  where  the  problem  ie  the  same  as  in  the 

linear  programming  problem  except  that  in  the  objective  function  there  is 

is 

a  fixed  cost  associated  with  turning  a  variable  on.  It  vAIl  he  the  purpose 

of  this  paper  to  summarize  this  method  and  another  one  solving  a  closely 

related  problem  end  to  compai’e  the  two.  It  turns  out  that  Federowicz's 

so 

problem  is  a  special  case  of  the  other  problem  and  thun  we  ohell  present, 
later  onT  a  generalization  of  this  problem <oe2>  P  <£> 


The  problem 

The  first  problem  considered  by  Federowicz  was: 


n 


Minimize 


J-l 


c 

j~i  J 


Subject  to 


J-l 


Vj  2  bi 


i  *  1, . .  m 


6j  -  0  if  Xj  *  0  J 


1 , . . .  n 


if  x,  >  0 

J 


Xj  —  o 


J  «  1, . . .n 
J  m  1, . . .n 


This  problem  has  applications  with  the  problem  of  scheduling  which  of 
a  certain  number  of  faculties  ehould  be  turned  on  and  how  much,  if  a  fixed 
charge  is  to  be  paid  for  turning  them  on,  and  other  applications  (see  [1] 

p-  10). 

B.  The  Method  -  A  General  Description 

First  consider  the  domain  where  the  variables  are  defined,  i.e.. 


Xj  >  0  J*l,2,o..n.  We  shall  call  a  subset  of  the  domain  a  subset  that  has 


the  property  that  Xj-0  or  Xj>0  j«l,2,...n.  Thus  {xjx^  -  OjXj  -  0,  x^,...xnX)} 
io  a  subset.  Now  consider  the  function  C(x)  ■  £cjX .  c jxj  defined  on 
each  of  the  subsets.  It  is  linear  and  convex  there  and  hence  we  can  use  the 
simplex  method  on  each  of  the  2n  subsets  and  take  the  minimum  of  all  of  these. 
This,  however,  is  too  lengthy  and  we  must  find  a  method  that  will  solve  the 
problem  without  necessarily  looking  at  all  of  the  subsets .  To  this  end,  con¬ 
sider  the  union  of  a  certain  number  of  the  subsets  such  that  the  union  io 
convex.  We  will  call  this  a  group.  New  consider  just  the  variables 
such  that  x,  >  0  or  ^  >  0,  l.e.,  the  ones  that  are  definitely  turned  on 
end  the  ones  that  may  or  may  not  be  turned  on.  Now  consider  C(x)  and  the 
constraints  where  all  the  other  Xj  **  0,  and  consider  this  problem  eolved  by 
the  simplex  method,  disregarding  tne  effect  of  the  fixed  costs.  Now  if  wo  add 
Just  the  fixed  costs  of  thos*  definitely  turned  on  we  obtain  a  lower  bound 
on  the  final  minimum,  or.  this  group,  because  the  only  way  we  can  get  a  lower 
coot  is  to  turn  off  one  of  the  variables  Xj  >  0  and  not  incur  its  fixed 
coet^  tut  we  never  add  it,  thus  Jt  is  a  lower  bound.  Or,  we  can  think  of 
the  function,  C(x)  -  a, *  f(x)  where  f(x)  is  just  the  sum  of  the 
fixed  charges  definitely  on,  as  a  lower  bounding  function  on  C(x)  in  the 
group.  Thus  the  algorithm  ie  as  follows:  First  we  consider  the  whole  spe.ee 
Xj  >  0  as  a  group,  then  obtain  a  lower  bound  and  use  tha  actual  value  ob¬ 
tained  as  an  upper  bound  on  the  final  minimum  We  then  remove  the  subset 
found  by  the  simplex  algorithm  where  the  minimum  occurs  of  the  bounding  function 
and  partition  the  rest  of  the  group.  We  find  a  lower  bound  and  a  feasible 
solution  on  each  of  these  and  take  the  one  that  has  the  lowest  lower  bound 
We  then  remove  the  subset  ao  before  and  continue  on,  each  time  parti  -ioning 
the  group  that  has  the  lowest  lower  bound  until  we  finally  have  a  lower  bound 
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on  the  remaining  groups  (the  whole  sot  minus  the  removed  subsets)  that 
exceeds  the  value  of  one  of  the  removed  subsets.  Then  this  removed  subset 
with  the  lowest  minimum  is  the  solution  since  all  the  removed  subsets  have 
a  greater  minimum  and  we  have  a  lcwer  bound  that  exceeds  this  minimum  for 
the  croups  that  are  left. 


C.  The  Algorithm  Used 
n 


n 


Let  C[X] 


c,x,  ♦ 


j“l  J  J  j-1  i  J 


and  X  is  some  convex  of  the 


2n  subspaces,  and  let  C(X)«  Z.  c  x  ■*  f(x)  where  f(x)  is  the  sum  of 

j“l  J  « 

the  fixed  costs  of  the  variables  definitely  turned  on.  Let  S  be  a  rule 

for  determining  how  to  subtract  one  of  the  2n  subsets  from  some  group  of 

them  and  dividing  the  remainder  into  groups. 

Let  X~be  the  set  of  all  the  2n  subsets  and  X  a  particular  group 
“*  ■'  n 

of  the  2n  subsets. 

The  algorithm  is  as  follows: 

1.  Minimize  C(Xyj_)  by  the  simplex  method,  let  the  eubset  on  which 

i 

this  occurs  be  X  , .  . 


2.  Set  p  ■  1,  X  «  X  ,  r  ■  0. 

% 

3.  Apply  S  to  X  -  X  . 

T>  P 

4.  Let  X  -  X  ♦  x"  .  ♦  x"  . .  ,x" 

cjp  p  r+1  r+2  r+np 

5  Set  I  «■  1,2, . .  .n^ 

6.  Minimize  C[X^j3  by  the  Simplex  Method,  let  the  subset  on  which 


this  occurs  be  X 


r*T 


7*  Set  r  »  r  ♦  n  ,  p 
P 


p  ♦  1 
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8.  Figidj  min  n  C  [x^]} 


CT  min 


y. 

10. 

n. 


Find  min  {min  C[x^3)  -  C^,  min  where  k  excludes  previous  Cp  min. 

If  c  min  >  C_,  min  then  we  are  done  and  C_  min  is  the  answer. 
~  T  T 


Otherwise,  1st  the  k  from  which  C  ,  min  occurred  be  denoted  by 

Hat 

k  .  Let  X 
P 


*k  *  *k  - 


12.  Go  to  step  3. 

It  can  easily  be  eeen  t>iat  the  algorithm  will  terminate  because  there 
are  oily  a  finite  number  of  subsets  to  subtract,  and  eventually  they  will  all 
be  looked  at  if  a  solution  is  not  found  before.  This  is  a  specialization  of 
a  more  general  algorithm  given  by  Federowiez,  pp.  65-67  in  Cl 3  -  The  follow¬ 
ing  example  is  the  one  worked  out  by  Federowiez  in  [1] 


D  Example 

An  example  will  serve  to  illustrate  and  clarify  whet  has  been  said.  The 
problem  is  to  minimize  C(x)  -  lOOOx^  *  1000x2  *  lOCOx^  ♦  3006^  *  700 ♦  4006^ 

where  6j  **  0  if  x^  “  0 
-  1  if  x,  >  0 

O 

subject  to  r  >0  J  *  1,  2,  3 
^  ■*  2/3  »•  >  1 

1/2  Xj  ♦  Xg  ♦  2/3  Xj  >  1 

We  may  now  diviae  the  space  into  8  subsets  where  the  cost  function  and 
the  constraint*  are  linear.  Thus  one  can  oolve  each  of  uheae  problems  in 
each  of  the  2^  or  more  generally  2n  subsets.  They  are: 
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M 

• 

■ 

1 

1 

o 

4.  x^  x^O,  x^  -  0  7.  x^. 

x^O,  x1  -  0 

2.  x^70,  x2  -  Xj  »  0 

3.  xj*>,  “  *3  “  0 

Enumeration  of  all 

5-  x^70,  ^  -  X2  “  0  8.  x^, 

6.  x^  x370,  1/2  -  0 

of  these  subsets  is  as  follows: 

*2'  *370 

Subset 

Minimum  Point 

Minimum  of  C(x) 

1 

No  solution 

- : — 

2 

(2,0,0) 

2300 

3 

(0, 3/2,0) 

2250 

4 

(1/2, 3/4,0) 

2250 

5 

(C,  0,3/2) 

1900 

6 

(0,0, 3/2) 

2200 

7 

(0,3/5, 3/5) 

2300 

8 

(0,3/5, 3/5) 

2600 

It  will  be  noticed 

that  on  subsets  6  and  8  x^  -  0 

whereas  this  is 

outaide  the  region  of  definition.  This  is  remedied  by  letting  x^  be  arbi 

tr&rily  small  but  positive.  Thun  we  see  that  we  may  come  as  close  to  the 
specified  cost  an  xe  wish.  In  other  words,  this  cost  represents  the  greatest, 
lower  bound  of  all  the  costs  in  that  region  It  will  also  be  noticed  that 
any  ouch  subsets  cannot  have  the  solution  because  by  turning  off  the  variable 
that  becomes  arbitrarily  sisal  1  we  do  not  incurs  its  fixed  cost,  and  the  cost 
is  thus  smaller  in  some  other  region 

From  the  table  one  can  tee  that  the  minimum  is  1900  and  it  occurs  at 
(0,0, 3/2).  To  find  thiea  however,  wa  had  to  look  at  all  of  the  2n  •  2^  •*  8 
subsets,  which  la  mush  too  laborious .  What  we  would  like  is  a  method  which 
looks  at  far  fever  subsets  ano  can  tell  it  ie  done  without  necessarily  look¬ 
ing  at  them  all. 
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tha 

Let  us  first  denote  a  group  oj/o'  subsets  by  and  Bq  vdll  represent 
(xjx.^,  >  0,  Xg  >  O,  Xj  >  0}.  C0  (x)  in  general  will  be  -  e^x^  ♦  f(x) 

where  x  Bn  and  f(x)  is  the  sum  of  only  those  fixed  costs  definitely  turned 
on.  Thus,  Cg  (x)  *  10C0(x^  ♦  ♦  x^).  Also  it  is  noted  that  C(x)bCg  (x) 


for  x  B  which  is  the  union  of  all  of  the  8  subsets.  Therefore  the  mini- 
o 

mum  of  Cg  may  serve  as  a  lower  bound  on  the  minimum  of  C(x).  The  mini- 
o 

mum  of  Cg  may  be  found  by  the  simplex  method,  and  we  find  it  1b  at 
o 

(0,  3/5,  3/5)  and  C0  (xniQ  '  "  1200  and  C(xM<  )  -  2300.  V/e  now  remove 

o  n  \ 

set  7  from  consideration  and  divide  the  remaining  subset  into  three  groups. 


defined  as  follows 

Set 
$cjs  , 


Set 


1,2, 5,6  B1"{x  x1>0)x2>0,x.3>0} 

3*4  B2-(x  x^>p,x.£>0,x^> 0) 

S  B^-{x  Xj  X) ,  x^X) ,  x^>0 } 


Lower  Bounding  Function 
C0  (x)-1000(xj  ♦  x^) 

Cg  (xJ-IOOOCxj  ♦  x^)  ♦  7C0 

Cg  (x)-10C0(x1  *  X2  ♦  Xj)  *  1400 


The  rule  S  for  choosing  the  Bn  will  bo  discussed  later.  It  can  be 
seen  that  it  has  a  great  deal  to  do  with  the  success  of  the  algorithm, 
now  using  the  simplex  algorithm  on  these  three  functions  we  find; 


Function 
Cr 


B, 


E>„ 


Minimum 

Kin  Ca  (x) 
n 

C(x)  where  Cfi  (x. 

n 

Point 

(o,o, 3/2) 

1500 

1900 

(1/2, 3/4,0) 

1950 

2250 

(0,3/5, 3/5) 

2600 

2600 

Now  we  see  since  the  minimum  of  ,  C_  ,  C„  ie  1500  we  have  now 

D1  B2  B3 

a  new  lower  bound  on  the  minimum,  C(x )  on  the  subsets  1,2, ,..6, 8.  Also 
we  have  a  new  value  for  C(x)  of  1900,  which  was  better  than  before.  Now 
since  this  minimum  occurred  on  subset  5  we  remove  it  from  and  look  at 

the  rest,  i.e,,  1,2,  and  6.  V/e  divide  them  as  follows: 


Set 

Nos.  bet 

1,2  *■  {xjxL>p,x2  -  0,x^  0} 

6  B$  -  {%Jx3>0,x2  -  0,x3>0} 


Lower  Bounding  Function 


Cg  (x)  -  lOOOx^ 

Cg  (x)  -  1000(x1  *■  x3)  ♦  700 


And  solving 

each  of  these 

we  find: 

Bounding 

Kin. 

Min.  of  C,, 

Bi 

Corresponding 

Function 

Pt. 

Min  C(x) 

Vx) 

(2,0,0) 

2000 

2300 

V*’ 

(0, 0,3/2) 

2200 

2200 

Now  we  find  that  the  minimum  of  C-  ,  CD  ,  C  ,  C  is  1950,  and  therefore 

B2  B3  BU  B5 


this  may  serve  an  a  lower  bound  on  the  minimum,  of  C(x)  on  1,2, 3, 4, 6, 8. 


The  best  C(x)  found,  however,  was  1900  on  subset  5.  Therefore,  we  may 
conclude  that  (0, 0,3/2),  the  minimum  point  found  on  5,  is  the  minimum  of 
C(x)  since  C(x)>1950  on  1,2, 3,4, 6, 8  and  C(x)>2300  on  subset  7. 

It  should  be  observed  that  on  each  of  the  2r  subsets  C(x)  Is  linear 
and  convex  and  thus  the  problem  may  be  solved  completely  on  any  one  of  them. 
However,  on  a  group  of  subsets  such  as  the  one  can  only  bound  the  C(x) 

with  a  linear  convex  function,  which  allows  us  to  use  the  simplex  algorithm. 


~Q. 


Also  we  note  that  the  sequence  1200,1500, 1V5C  may  be  identified  with 
Cp  min  and  the  sequence  2300,1900,1900  as  tain  in  section  C.  Also, 
the  Cg  (x)  with  C[X£]  and  -O-  "  Bq. 


E.  Reduction  of  Computations 

In  the  preceding  example  it  will  be  noted  that  six  times  a  function 
had  to  be  minimized  by  the  simplex  method.  This  does  not  seen  like  much 
of  an  improvement  over  the  original  eight  times.  However,  we  need  not 
start  from  scratch  each  time.  When  a  minimum  is  found,  to  go  to  a  different 
region  we  need  only  eliminate  the  appropriate  basic  element  by  pivoting  on 
its  row  and  then  removing  its  column.  An  example  will  illustrate  and 
clarify.  The  original  tableau  for  the  problem  is: 


Faculty 

1 

2 

3 

4 

5 

Fixed  Costs 

300 

700 

400 

0 

0 

Unit  Costs 

1000 

1000 

10C0 

0 

0 

Requirements 

Matrix 

1 

l 

2/3 

1 

-1 

0 

1 

1/2 

1 

2/3 

0 

-1 

solving  this  we  find: 


Faculty 

1 

4 

5 

Fixed  Costs 

300 

0 

00 

Unit  Costs 

1000 

0 

0 

Faculty 

Fixed-Cost 

Unit  Coats  Amount 

Matrix 

3 

400 

1000  3/5 

6/5 

-9/5 

6/5 

2 

700 

1000  3/5 

-3/10 

6/5 

-9/5 

Marginal  Costs 

-100 

-600 

600 

I 
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If  we  wish  now  to  find  min  [CD  (x)]  we  nnist  eliminate  faculty  2 

1 

frora  the  basis  and  minimize  the  result.  Only  4  can  be  used  to  eliminate 
2,  thus  we  must  pivot  on  6/5  and  forget  about  that  column.  The  result  is: 


Faculty 
Fixed  Costs 
Unit  CostB 

Faculty  Fixed  Costs  Unit  Costs  Amount 

3  400  1000  3/2 

4  0  0  1/2 


Marginal  Costs 


1  5 

300  0 

1000  0 

Matrix 

3/4  -3/2 

-1/4  -3/2 

-250  -1500 


F.  The  Set  Subtractions  Problem 

Previously  it  was  stated  that  there  was  a  rule  (s)  for  determining 
how  to  break  up  the  groups  of  subsets  into  smaller  groups  and  just  one  of 
the  subsets „  The  rule  deecribed  by  Federowicz  will  now  be  presented 
(:3c  pp.  68-71  in  [1]).  First,  hcwever,  we  will  need  some  new  rotation. 

We  will  denote  a  group  of  subsets  by  a  series  of  B^'e  and  B^'s  "multiplied” 
together.  The  presence  of  Bi  will  indicate  ie  definitely  on.  The 
presence  of  B^  will  indicate  x±  *  0.  If  neither  is  printed  it  will  mean 
XjX)-  Thus,  B2  Bj  is  the  subset  4,  x1,x2>0,x^  -  0.  B1  is  the 
union  of  subeets  4  and  2,  i.e,,  x1>0,  x£C,  x^  -  0,  etc.  Thus  we  can  see 
that  any  two  different  representations  by  this  notation  represents  two 
different  convex  groups  of  subsets,  and  conversely. 


low  it  can  ds  observed  with  this  notation  that  there  are  a  number  of 


ways  of  dividing  any  group.  For  instance 
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4 

B2 

r> 

“l 

♦ 

B3 

B2 

B1 

+ 

B. 

B, 

♦ 

B_  B.  B. 

3 

1 

2 

3 

1 

4- 

B, 

B_ 

B_ 

B, 

B„ 

1 

2 

3 

1 

2 

4- 

B„ 

B„ 

♦ 

B, 

B_ 

B_ 

3 

2 

1 

2 

3 

♦ 

B_ 

B_ 

4- 

B, 

B_ 

Bo 

2 

3 

1 

2 

3 

♦ 

B, 

B_ 

4 

B„ 

B. 

Bo 

1 

3 

2 

1 

3 

°1  2  3  °1 

-  b2 

'  B2 
"  B3 


and  it  vd.ll  be  seen  that  each  division  corresponds  to  a  particular  ordering 
of  the  variables.  For  consider 


jTL  -  B  B  ...  B 
"l  n2 


B 


BB  ♦  B  B  B  ...  ♦  B  B  . . -B 


nl  n2 


nl  n2  °3 


nl  "2 


m 


where  B  •  B.  or  B.  for  boos  j  and  B  «  B.  ■=  B.  if  B  ** 
n,  j  j  J  ni  j  J  n± 

and  similarly  for  the  other  set  subtractions. 

The  rule  for  determining  the  ordering  of  B  is  as  follows* 
le  sst 


’ 


Let  /S  B’t  be^ ■ 

J-l  J 


subtraction  undsr  considerations  and  C  and  C  be 


the  same  as  before  and  f  r_  the  fixed  coats.  Then  we  divide  the  n  variables 
into  three  classes. 

Class  1:  Set  of  variables  for  which  B*  **  B^  and  C(X  )  ♦  f  j  >  C(^) 

Claes  2%  Set  of  variables  for  which  B*  -  Bj 

Class  3:  Set  of  variables  for  which  B*  -  and  C(X  )  +  fj  <  C(^) 

Class  1  consists  of  those  variables  which  were  off  in  the  linear  program¬ 
ming  solution  and  whose  fixed  cost  plus  the  lower  bound  exceeds  the  actual 
cost  obtained.  Since  by  turning  the  variable  x1  on  we  incur  at  least  fj 


we  know  that  it  must  not  be  turned  on.  Therefore,  Class  1  consists  of 
those  variables  that  were  turned  off  and  should  remain  off. 


Claes  2  consists  of  those  variables  which  were  turned  or.,  and  they 
are  placed  in  the  inverse  order  of  our  estimate  of  the  minimum  cost 
incurred  in  turning  each  one  off. 

Class  3  variables  are  last,  end  it  is  difficult  to  see  why  any  order¬ 
ing  is  better  than  ary  other  since  by  turning  the  variable  on  we  have  a 
lower  bound  lower  than  actually  obtained. 

G  Incomplete  Solutions 

It  can  easily  be  seen  that  if  the  algorithm  is  stopped  before  it  is 
finished,  the  current  minimum  on  the  removed  subsets  and  the  minimum  of 
the  lower  bounds  on  the  other  subset*  serve  as  upper  and  lower  bounds 
respectively  on  the  solution. 

In  the  computation  of  most  of  the  problems  worked  out  by  the  computer 
Federowicz  found  that  the  solution  was  found  relatively  quickly  and  what 
took  most  of  the  time  vaa  obtaining  a  lower  bound  greater  than  the  current 
minimum  found,  i.e.,  verifying  that  the  minimum  hae  been  found.  For  ex¬ 
ample  (p.  74  in  [1])  one  problem  took  31  revisions  to  find  the  minimum 
and  85  further  revisions  to  verify  it.  Another  problem  took  4  revisions 
to  find  the  minimum  and  41  to  verify  it.  Thus  one  is  tempted,  for  larger 
pi-oblems,  to  quit  after  the  same  minimum  has  been  found  a  certain  pre¬ 
determined  number  of  times,  and  use  that  minimum  an  a  solution. 


II.  GENERALIZATION  OF  FEDEROWICZ'S  METHOD  TO  SCWARC'S  PROBLEM 


Federowics  considers  a  generalisation  of  his  algorithm  (see  p.  65 
in  Cl])  and  to  solve  Srwarc 's  problem  ve  must  apply  the  generalisation 
rather  than  the  actual  algorithm  previously  considered.  Szwarc  s 
problem  is  as  follows: 

n 

Maximize  z  Z  c ,  x 

j-1  J  J 
n 

subject  to  £  a,  *  £  b.  and  x,  »  0  or  1  for  J  •  1,  2 

j»l  i  J 

i  «*  1,  2„.„m,0<Xj<dj  for  j  -  p  *  1,  , , 

Now  one  can  see  that  the  algorithm  will  be  somewhat  different  frcm 

before  and  is  as  follows:  First  we  consider  the  2?  subsets  as  before  and 

the  problem  defined  on  each  of  them  It  is  easy  to  see  that  each  one 

could  be  solved  by  the  simplex  method  and  we  could  then  take  the  best 

solution  However  this  is  too  lengthy.  As  before  we  instead  consider 

a  group  of  these  subsets  whera  come  are  definitely  O,  some  definitely  1, 

and  others  either  0  or  1,  and  we  shall  obtain  an  upper  estimate  on  the 

final  maximum.  Now  suppose  we  have  such  a  group  of  subsets  If  x^(j  <  p) 

is  definitely  0  or  definitely  i  it  will  remain  so  in  the  objective 

function  and  the  constraints.  If  x^  >  0(j  <  p)  x^  ••rill  be  considered 

to  be  1  in  s(x),  the  objective  function,  if  >  0  and  x^  will  be 

considered  to  be  0  if  c^  <  (|„  The  lower  estimate,  similar  to  before, 

shall  be  the  value  of  the  basis  that,  is  feasible  and  satisfies  the 


Boolean  constraints.  For  the  upper  estimate  we  now  need  only  worry 
about  the  constraints.  For  each  constraint  inequality  we  will  derive 
a  new  one  which  will  amount  only  to  changing  the  b  if  we  only  wo  rk  with 


*  a  o 


X  /  —  *  a  o  x  In  the  simplex  solution.  That  la  B.  -  b.  -  Z  a*  where 
P  r  1  a  11 

if  Xj  is  1  define  tely,  or  if  a^j  <  0  and  x^  Is  in  douht 
^*70  if  x j  is  0  definitely,  or  if  a^  >  0  and  x.  la  in  doubt 

Thus  with  Ba  instead  of  b^  and  considering  only  xp  +  ,  ,  .  xq 

we  can  now  solve  the  linear  programming  problem  using  the  simplex  method, 
and  this  serves  as  an  upper  bound  since  any  value  of  the  Xj’s  in  doubt 
that  satisfy  the  original  constraints,  satisfy  the  new  constraints. 


A,  Reduction  of  Computation 

As  before  we  need  not  make  a  new  tableau  and  3olve  each  one  from 
scratch  in  the  simplex  algorithm,  If  wo  record  each  <  p)  a«d 

evaluate  it  after  the  first  minimization,  Le„  of  the  lower  bound.  We 
may  then  pivot  to  get  rid  of  those  values  of  x^  in  the  basis  which  become 
negative  when  the  appropriate  change,  and  maximize  the  resulting 
tableau o  Also  in  changing  from  group  to  group  we  need  only  pivot  to 
get  rid  of  tha  negative  B^,  and  then  maximize  the  resulting  tableau, 

B,  Set  Subtraction  Problem 

Now  we  must  detenajjie  which  subset  to  extract  from  the  group  that 
is  to  be  divided  and  how  to  break  It  up  Before,  the  simplex  solution 
gave  the  subset  to  be  extracted  but  in  our  case  this  need  not  be  the 
case,  and  in  fact  would  not  be  desirable*  since  we  know  beforehand  on 
which  subset  our  feasible  maximum  comes  frem  (vhe^e  all  the  x^  In  doubt 
-  I),  Thus  the  only  thing  that  would  distinguish  our  algorithm  from 
enumeration  (if  we  were  to  always  to  extract  that  subset)  would  be  the 
ordering  used  to  determine  how  to  break  up  the  group  and  which  group 
has  the  greatest  upper  bound.  Thus  to  gain  greater  freedom  ecme  better 
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wny  of  determining  which  set  to  extract  would  be  in  order.  One  method 
might  be  to  turn  ofTthoee  v&riablee  In  the  matrix  that  cause  the  total 
increase  in  the  of  the  matrix  minus  its  fixed  charge  in  s  if  it  is 
negative  and  plus  if  it  is  positive  to  be  the  greatest.  Those  with  a 
positive  increase  would  be  turned  off  and  those  with  &  negative  increase 
kept  on. 

As  for  subtracting  the  set  we  have  now  just  to  investigate  which 
ordering  should  be  used  to  determine  how  to  break  up  the  group  of  sub¬ 
sets  in  question*  In  contrast  to  the  example  we  should  use  more  information 
than  just  the  fixed  cost  in  the  objective  function.  We  may-  put  those 
variables  first  which  are  on  and  which  their  fixed  cost  in  the  objective 
function  subtracted  from  the  upper  bound  minus  come  estimate  of  the  de¬ 
crease  In  the  objective  function  due  to  the  other  variables  is  smallest. 

Thus  using  this  total  estimate  we  may  order  all  the  variables  in  this 
problem  and  determine  the  resulting  groups  of  subsets  as  before. 

C „  Conclusion 

One  will  notice  that  this  generalisation  is  somewhat  different  from 
the  original  problem,  because  of  the  added  Boolean  constraints  and  because 
in  the  solution  in  finding  the  estimates,  just  the  domain  of  definition 
varies,  not  the  objective  function.  And  when  extracting  a  subset  from 
a  group  we  still  must  find  the  maximum  on  the  subset  extracted.  It  is 
not  automatically  provided. 

Do  Original  Problem  As  A  Special  Cass 

It  should  be  noted  that  the  first  problem  may  be  stated  as  a  special 
case  of  Sswarc’s  problem  (with  min  replaced  by  max)  as  follows: 
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Maximize 


where 


n 

n 

^  !>i\ 
j-0  J  J 

♦  Z 

J-o 

“  dJ  *J  ' 

"ji  0 

n 

^  ,ai J 
j-1 

>  0  and  1  i,  »  O  or  1  ard  the  d. 


2  «  o  j  n 


arc  big  enough 


IS' 


III.  IKE  GENERALIZATION  CF  FEDEROWICZ'S  METHOD  APPLIED  TO  At.  EXAMPLE 
SOLVED  BY  FEDEROWICZ'S  ORIGINAL  METHOD. 

The  problem  ie  to  minimize  ■  «  300x^  ♦  70Ox^  *  UOOx^  ♦  1000 (x^+x^+x^ 
subject  to 

x4  *  2/3  *5  +  x6  -  1  2*1  ~  x4  £  0 

1/2  x'h  ♦  x5  *  2/3  x6  >  1  3/2  Xg  -  x$  >  0 

3/2  ^  -  x6  >  0 

The  dual  tableau  will  look  like: 


yl 

y2 

y3 

y4 

y5 

*1 

300 

0 

0 

2 

0 

0 

*2 

700 

0 

0 

0 

3/2 

0 

*3 

400 

0 

0 

0 

0 

3/2 

*4 

1000 

1 

*l/2~ 

-1 

0 

0 

x5 

1000 

2/3 

1 

0 

-1 

0 

x6 

1000 

1 

2/3 

0 

0 

-1 

x6 

1000 

-1 

-1 

2 

3/2 

3/2 
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And  after  pivoting 


X6 

X5 

y4 

y5 

x1  300 

0 

0 

2 

0 

0 

Xj  200 

0 

0 

0 

3/2 

0 

x,  400 

0 

0 

0 

C 

3/2 

x.  100 
4 

-6/5 

3/10 

-1 

-3/10 

6/5 

y2  600 

-6/5 

9/ 

0 

•9/5 

6/5 

yL  600 

9/5 

-6/5 

1  j 

6/5 

-9/5 

1200 

3/5 

3/5 

2 

9/10 

-9A0 

Thus  we 

now  see  that  we 

have  a 

lower 

bound  of  1200  on  j  L  Just  as  In 

the  origi  nal  problem- 

Mow,  because 

of  the 

nature  of  the  matrix  we  may 

subtract. 

b2  b3 

from 

n 

Vre  do 

this  because  we  eee  that  would 

be  0,  if  x^  were  0,  and  this  is  not  so  with  any  of  the  others.  Thus,  on 

2300  is  the  minimum  and  this  would  bo  the  first  element  of  the 
decreasing  sequence,  how  suppose  we  wish  to  find  the  ml r.lmum  on  B 2«  To 
do  this,  we  first  change  the  last  row  only  in  y^  column.  By  subtracting 
3/2  we  sea  that  the  9/10  changes  to  -3/5,  thus  we  pivot  on  this  column  to 
obtain  a  new  lower  bound  of  1500  as  before,  and  then  we  continue  on  to  the 
other  group®  Thus  one  can  oaa  that  with  ths  correct  set  subtraction  we 
can  Obtain  tho  same  algorithms  as  before,  only  the  matrix  is  different  and 
the  rules  Tor  pivoting  from  group  to  group  are  different.  However,  the  re¬ 
sulting  two  sequences  are  identical,  if  the  set  subtractions  are  the  same. 
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IVo  SZWARC fS  METHOD 

In  his  paper  [2]  Szwarc  presents  a  completely  different  method  from 
Federuwlcs’s  in  solving  the  mixed  integer  linear  programming  problem , 
Federowica  •  s  method,  roughly  speaking,  is  to  start  with  a  feasible 
maximum  and  continually  get  better  ones  until  the  estimates  indicate 
that  the  right  one  is  found*  Thus  he  works  his  way  up  through  feasible 
maxlmi*  Szwarc  * s  method  is  just  the  opposite*  He  works  his  way  down 
through  infeasible  aunrimmi  until  a  feasible  solution  (one  satisfying 
the  Boolean  constraints)  is  found  and  thus  the  computation  is  ended* 


A*  The  Problen 


A  slightly  more  general  formulation  of  the  problem  than  previously 
stated  is: 


Maximize  £  c,  x,  ■*  x 
J-l  J  J 
n 

subject  to  £  a.  x,  -  b. 

j*l  J  1 


x  -  1,  2 


m 


(1) 


0  <  Xj  <  1,  J  -  1„  2, 

J  *  p  ♦  1,  ...  *n* 


.  p  <  n,  0  <  Xj  <  dj  (dj  >  0), 

J  **  1,  *  *  a  p» 


(2) 

(3) 


We  shall  call  this  problem  I*  This  is  the  problem  we  eventually  wish  to 
solve. 

The  first  part  of  condition  (2)  may  seem  unneeded  considering  condition 
(3),  but  we  shall  form  another  problem,  called  problem  II,  where  we  only 
have  conditions  (1)  and  (2)*  That  is  for  problem  II  we  maximize  the  ease 
function  as  problem  I  but  with  just  the  conditions  (1)  and  (2)0 
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If  there  are  several  feasible  solutions  for  xg(t)  we  must  consider 


them  all „ 


We  also  know  that  xg(t)  is  a  linear  function  of  t  in  the  Interval 
Cts«l'  ts1' 

N'cs*  it  can  be  proved  (see  Scwarcz  Appendix  in  [2]), 

1^  The  minimum  value  of  t  such  that  xg  (t )  satisfies  the  Boolean 
constraints  la  the  solution  to  problem  1-  This  is  easily  seen 
by  looking  at  consitlon  (1)‘» 

2  The  solution  to  problem  I  occurrs  at  a  comer  point  of  problem 

III-,  That  is  either  x  (t  .  )  or  t  (t  )  satisfies  the  Boolean 

S  8*1  8  8 

constraints  if  some  t.  in  [tg  ^ ,  tg]  allows  xg(t )  to  satisfy  the 

Boolean  constraints,  and  if  xg(t  )  satisfies  the  Boolean  constraints 

t  JLs  the  unique  value  of  t  or  x  ft  ,  )  also  satiefiee  the  Boolean 
£  s  s— JL 

constraints , 

B  The  Algorithm 

life  car  now  see  mo  *e  or  less  how  to  solve  the  problem  with  the  help 
of  these  last  two  statements  The  following,  however,  is  an  algorithm 
proposed  by  F,  M.  Tonga  in  his  paper  L3)* 

1  Find  the  solution  to  problem  II,  (in  general  there  will  be  only 
1.  and  it  will  not  satisfy  the  Boolean  constraints  If  it  does, 
however,  we  are  finished)  and  consider  problem  III, 

2  Establish  a  list  L  which  is  to  be  a  list  of  extrema  points,  in  a 
set  of  basis  vectors  x  ^  ( t ) ,  and  always  keep  it  increasing  order 
of  t_  max  , 

3>  Find  each  optimal  basic  solution  to  problem,  and  find  all  first 
intervals,  in  the  x^(t)  sequences,  and  find  the  t  max  for  that 
interval  and  enter  it  in  the  appropriate  spot  in  the  list  L, 


Also  check  to  nee  If  this  entry  in  the  list  satisfies  the  Boolean 
constraints  and  if  it  does  delete  all  entries  in  the  list  that 
have  a  greater  t  max,. 

That  is,  if  i’  is  the  row  of  the  constraint  becoming  negative, 
enter  in  the  list  L  the  basis  formed  by  replacing  i*  by  with 
t  max  given  by 


where  Is  ths  coefficient  of  t  in  equation  i,  and  ^ ,  <  0<. 

4°  Consider  the  first  entry  in  the  list  L„  If  it  has  satisfied 
the  Boolean  constraints  (it  will  have  been  checked  before)  we 
are  done.  If  not  move  to  step  5» 

5,  Remove  the  first  entry  from  the  list,  then  for  that  entry  find 
all  the  possible  next  intervals  and  enter  their  basis  in  the  list 
as  in  step  3,  checking  for  Booleaness.  Then  return  to  step  k 
If  no  further  intervals  can  be  found,  no  solution  to  problem  I  exists. 
It  should  be  noted  that  due  to  the  nature  of  constraint  (  1*  )  there 
will  in  general  be  a  number  of  first,  second,  etc,  intervals.  In  fact, 
this  is  the  main  disadvantage  of  the  algorithm  , 

It  was  pointed  out  by  Tonga  that  in  the  list  L  only  changes  in  the 
bases  need  be  considered  in  each  entry  s  (see  page  3  in  (33)-' 
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V.  SZh'ARC  'S  MSI' HOD  APPLIED  TO  AJ.  EXAMPLE  SOLVED  BY  FEDEROWICZ'S  METHOD 

Szwarc's  algorithm  ia  primarily  concerned  with  a  maximization  problem. 
However,  it  is  easily  seen  how  to  apply  it  to  the  minimization  problem, 
in  particular  the  example  considered  by  wederowicz .  The  rest  o f  this  paper, 
however,  will  be  concerned  with  the  maximization  problem. 

The  problem  again  is  to: 

Minimize  -  1000  (x1  <  x2  ♦  x^)  *  300  +  700  f>2  +  1400  b 

where  ®  if  Xj  -  0 

-  1  if  Xj  >  0 

subject  to  Xj  >  0 

^  *  2/3  >  1 

1/2  ♦  atg  ♦  2/3  x3  >  1 

Or  in  Sssvarc's  form: 

Minimize  C(x)  "  3000^  +  ^OOXg  ♦  AOOx^  +  1000  (x^  *  y^  +  x^.) 


subject  to 

w 

Cj. 

IV 

o 

-  x^  >  -  1 

2Xi  -  x4  >  0 

xu  + 

x5  +  x6  —  1 

-  X2  >  -  1 

3/2  Xg  -  x5  >  0 

±,'2  x^  ♦ 

x5  +  2/3  *6 

-  Xj  >  -  1 

3/2  y3  -  x6  >  0 

(1) 

(2) 

(3) 

where  x^,  x^>  x^  arc  0  or  1.  This  is  problem  I.  Problem  II  is  the 
same,  only  without  the  last  constraint.  Problem  III  is  the  same  as  problem 
II,  except  we  have  the  paremetric  constraint  300x^  ♦  VOOx^  ♦  UOOx^  *  1000 
(x^  ♦  Xj  ♦  x^)  >  zq  ♦  t  where  zq  ie  the  minimum  found  by  problem  II. 
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The  constraints  (1)  are  those  that  insure  that  the  Boolean  variables 
do  not  exceed  1.  It  is  hoped  that  they  will  turn  out  to  be  1  or  0  in 
problem  III. 

The  constraints  (2)  are  those  that  insure  that  the  Boolean  variables 
will  be  turned  on  with  their  correct  counterparts,  x^,  x^,  x^  correspond 
to  gx,  g2,  and  Xy  x^,  x ^  to  y2>  x^  in  the  previous  problem. 

The  coefficients  of  x^,  x^,  x^  in  (2)  are  obtainel  from  the  constraints 
in  the  previous  problem. 

The  constraints  (3)  are  identical  to  before.  The  initial  tableau  for 
problem  II  will  look  like: 


Facility 

1 

2 

3 

4 

5 

6 

Res  trsinti 

f- 

J 

-1 

0 

Matrix 

0  0 

0 

0 

{1)\ 

-  1 

0 

-1 

0 

0 

0 

0 

\ 

1 

I 

0 

0 

-1 

0 

0 

0 

2 

0 

0 

-1 

0 

0 

(2) 

i° 

0 

3/2 

0 

0 

0 

V  0 

0 

0 

3/2 

0 

0 

-1 

f  1 

0 

0 

0 

!. 

2/3 

1 

(3) 

J 

ui 

* 

0 

0 

0 

lA: 

1 

2/3 

costs 

+300 

♦  700 

+400 

+1000 

+10C0 

+1000 

Rewriting  this  in  the  dual  form  and  maximizing,  we  have: 
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*2  520 

-6/5 

9/5 

6/5 

-4/5 

0 

-6/5 

4/5 

0 

x.  3320 

X  3 

9/5 

-6/5 

-4/5 

22/15 

0 

4/5  - 

22A5 

0 

y.  2220 
3. 

3/8 

-6/5 

-4/5 

7/15 

-1 

4/5 

-7/5-5 

0 

x,  800 

3  3 

0 

0 

2/3 

0 

0 

-2/3 

0 

0 

x.  1400- 
4  3 

0 

0 

0 

2/3 

0 

0 

-2/3 

0 

yx  300 

0 

0 

0 

0 

2 

0 

0 

-1 

1640 

3/5 

3/5 

2/5 

2/5 

0 

3/5 

3/5 

1 

Thus 

the  eolution  to  problem  IT.  is 

(o. 

2/5,  2/5, 

0,  3/5,  3/5)  and  the 

minimum  is  1640  which 

is  lees  than  1900 

,  the  minimum  with 

the  added  con- 

8traint 

Of 

*2*  y3 

< 

as  known 

from  Federowic*  ’o 

problem).  Now 

we  must 

add  the  new  constraint 

C(T)  - 

z 

n 

+  t.  With 

this 

added  column,  we 

Bee  that  we  repeat  the  same  final  column  as  before,  only  -t  will  be  in  the 
last  row  (first  element).  Vo  must  now  pivot  on  this  column  with  each  of 
the  entries.  After  doing  this  we  see  that  all  the  entries  in  the  last 
row  are  positive  for  t  is  sufficiently  small  in  all  cases,  except  one 


where  we  pivot  on  the  y^  row.  Since  the  new  entries  in  the  c ^  column 

Q  & 

are  equal  to  c ,  ♦  :1.;j  where  a  ,  ,  is  the  pivot  element  of  the 

J  Q  1  I  *  J 


newly  added  column.  Thus  the  t  for  this  pivot  will  be  t  "min  J  i  J 

max  max  w  * 


where  a±'  . 
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Thus  wo  have: 


basic  change  from  y^,  y 6,  y^,  y2,  y$,  x 6,  x^,  yg,  R 


No. 


1 

*2  *  >  R 

— 

260 

x6 

2 

x.  -  >  R 

— 

-52II 

x> 

3 

•ij  -  >  R 

— 

600 

-  x6 

44 

x.  -  >  R 

4 

- — 

420 

*1 

5 

y-,  -  >  R 

— 

300 

''6 

Now  by  pivoting  on  the  column  we  do  not  change  the  objective 

(i.e.,  it  is  another  min  point)  function,  and  than  pivoting  in  the  R 
column  and  we  now  may  add  to  the  table - 


y .  -  >  x^,  '  A  -  >  R 
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-  y6  and  y3 

Thus  L  will  be,  in  order  of  the  numer  of  basis,  1,  5»  4,  2,  3-  Thus 
thf  first  one  locked  at  is  1.  Pivoting  we  have  with  the  new  column: 

(3ee  following  page). 


Mow  we  can  see  that  at  t  »  t  *=  260  we  have  0  or  1  as  the  first 

max 

three  elements,  and  thus  (0,  0,  1,  0,  0,  3/2)  is  the  minimizing  point. 

Since  there  were  no  elements  in  the  list  with  a  t  less  than  260, 

max 

this  is  the  solution  to  the  problem,  and  the  minimum  is  1900,  the  same 
as  obtained  by  Federowicz. 
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VJ.  A  COMPARISON  OF  FEDER(X\rICZ:£  METHOD  AND  SZWARC  'S  METHOD  FOR  SOLVING 
THE  MIXED  INTEGER  LINEAR  PROGRAMMING  PROBLEM  WHERE  THE  INTEGERS  ARE 
'ZERO  OR  ONE 

Now,  having  seen  both  methods  end  howl  they  have  been  applied  to  one 
specific  example,  we  can  say  something  about  the  relative  merits  of  the 
two  methods.  First,  we  shall  talk  about  the  problem  proposed  and  solved 
by  Saw  arc,  and  compare  it  to  the  same  problem  solved  by  Fedsrowlcz's 
generalized  algorithm  .  As  has  been  said  before,  the  attack  on  the  prob¬ 
lem  by  both  algorithms  are  roughly  the  same  in  that  they  both  use  the 
simplex  method  as  a  sub-algorithm  and  pivot  from  comer  point  to  comer 
point  in  searching  for  the  final  maximum.  The  difference,  however,  is 
that  Szwarc  defines  a  different  and  larger  problem  (problem  II)  which 

•l 

"contains"  the  original  problem  if  further  restrictions  are  made  (namely 
that  the  first  p  of  the  variable  be  O  or  l),  and  then  searches  down 
through  this  new  convex  set  of  feasible  solutions  until  a  solution  to  the 
original  problem  (problem  I)  is  found.  This  is  the  effect  of  the  parametric 
t  and  tha  added  constraint  in  problem  III.  Fedorowlcz's  algorithm  ,  how¬ 
ever,  does  quite  the  opposite.  He  does  not  change  the  original  problem  and 
alwayB  pivots  and  remains  in  basic  feasible  solutions  to  the  original  prob¬ 
lem  (problem  I  in  Szwarc 'e  algorithm).  He  starts  at  the  origin,  let 

us  say,  and  then  continually  increases  the  best  maximum  yet  found.  He  con¬ 
tinues  this  way  until  the  estimates  on  the  rest  of  the  set  (not  necessarily 
convex)  indicate  that  the  rest  of  the  set  is  less  than  the  current  maximum. 
Thus  we  can  see  that  Federowicz's  algorithm  works  with  a  smaller  set  which 
is  not  convex  and  finds  his  way  up  to  the  maximum,  whereas  Szwarc  works 
with  a  larger  set,  a  convex  set,  and  he  works  his  way  down  to  find  ths 
solution. 

From  this  we  might  suspect  that  Federowicz's  method  would  almost  always 
be  quicker  (in  computational  time)  to  find  a  solution  than  Szwarc' s  method, 
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However,  there  are  two  reasons  why  this  is  not  necessarily  so.  First, 
although  the  total  number  of  extreme  points  is  increased,  this  does 
not  guarantee  that  there  will  be  a  larger  number  between  the  solution 
to  problems  I  and  the  solution  to  Problem  II  than  extreme  points  con¬ 
sidered  by  Federowlcz's  method.  Second,  not  only  must  the  extrme  points 
of  the  current  maximi  in  Federowlcz's  method  be  considered  in  computational 
time,  but  also  we  must  consider  computation  time  expended  in  applying  the 
simplex  algorithm  on  the  groups  of  subsets  used  to  estimate  the  maximum  on 

v 

the  subsets  not  actually  substracted.  Nevertheless,  we  can  say  something 

about  the  relative  computational  merits  of  the  two  algorithms. 

First  we  should  look  the  relative  computational  time  of  Federowlcz's 

generalized  algorithm  and  hi3  first  more  special  one.  On  the  same  problem 
* 

the  generalized  algorithm  will  be  slower  because  the  matrix  is  n  ♦  a  by  2n 

if  thei-e  are  n.  variables  and  m  constraints  in  the  original  problem. 

2 

Thus  we  see  that  the  tableau  we  are  to  work  with  is  increased  by  2  n  +  nun. 
Which  could  3low  the  algorithm  down  considerably  and  cause  evsn  more  prob- 

ft 

leas  with  storage  space.  This  also  shows  some  of  ths  difficulties  involved 
as  the  number  of  variables  increases.  Not  only  do  the  number  of  pivots  in¬ 
crease,  but  also  the  time  it  takes  to  make  each  pivot. 

method 

Now  we  ^ an  use  Federowicz's  firBt/as  a  standard  to  increase  in  some 
sense  the  other  algorithms.  We  have  seen  that  the  generalized  algorithm 
for  the  same  numbor  of  on-off  variables  and  the  original  problem  takes  the 
s’ame  number  of  pivots  as  the  standard,  but  takes  a  longer  time  to  make  them. 
In  the  application  of  Szwarc’s  algorithm  to  the  standard  we  see  that  the 
matrix  1b  the  same  size  as  the  generalized  algorithm  and  the  question  is 
how  operations  on  pivots  are  to  be  made  to  find  the  solution.  If  the  ex¬ 
ample  previously  exhibited  is  any  indication,  we  see  that  this  algorithm 
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ie  much  shorter  than  either  of  the  other  two.  However,  it  is  eafly  to  see 
that  there  exists  many  examples  where  it  takes  much  longer  to  find  the 


solutions  and  we  mubt  go  much  deeper  into  the  list  L. 

of 

Now  let  us  see  what  happens  when  we  vary  the  number^Boolean  variables 
relative  to  the  other  variables  in  Srwarc's  problem.  Let  us  say  that  the 
number  of  Boolean  variables  is  p  ard  the  number  of  non-Boolean  variables 
is  n  and  there  are  m  constraints.  Now  let  us  count  the  number  of  ex¬ 
treme  pnAwta  that  will  be  in  the  total  convex  set  considered  by  Szwar-i ’  s 
problem  II,  end  see  how  many  fewer  there  will  be  in  Federowicz’s  set  which 
ip  smaller  but  not  convex. 

Tiret,  lot  us  count  the  total  number  of  extreme  points  considered  in 

Federowicz*s  treatment  of  the  problem.  It  is  easy-  to  see  that  there  are 

n»n)  extreme^in^.  + 

2P  subsets  to  consider,  and  for  each  of  these  eufcsetsi (m  n)  ■  ^ • 

m  n!  mi 


the  number  of  combinations  of  picking  m  +  n  things  m  at  a  time]  since 
there  are  m  slack  variables,  regular  variables  5:  d  only  m 
variables  in  the  basis.  Thus  in  all  there  are  2*>(m  m  n)  extreme  points 
in  the  eet.  Next  let  us  consider  the  number  of  extreme  points  in  Piwarc's 
;t\?blem  II.  Cue  to  the  nature  of  the  first  p  constraints  both  the 
variable  and  ite  slack  variable  cannot  both  be  out  of  the  basis,  if  the 
variable  (or  its  slack  variable)  is  one  of  the  first  p.  Thus  we  have 
three  possibilities  for  each  of  these  first  p  variables.  Either  both 
it  and  its  slack  variable  are  in  the  basis,  or  just  the  regular  variable 
or  Just  the  alack  variable.  (If  x^  is  one  of  the  first  p  variables, 

1  -  x^  "  x^!  is  its  slack  variable.)  If  there  are  k  variables  that  have 
both  the  regular  variable  ar.d  its  slack  variable  in  the  basis,  ther*  are 
(  £  )  combinations  cf  the  variables  ouch  that  this  can  happen,  2  V  different 
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arrangements  of  the  other  p-k  Boolean  variables,  and  (*  *  £  )  arrangement  a 
of  the  other  non-Boolean  variables.  Thus ,  if  p  <  m,  we  have 


(P)/m+n\  2^k 

k  b  -  k 


extreme  points  and  if  p  >  m,  we  have 


extreme  points  or  more  generally 


mln(p,m) 


f  p  \  /rIn  n\  2p_k  extreoe  points.  And  thus  we  see  that 
k-5  ^k^^m-K 


mih(p,m) 


e  )(■* n) 

'  k'  ra  -  k  ' 


is  the  number  of  additional  extreme  points 


created  by  Szwarc '  s  problem  t'C.  Let  us  now  obtain  a  lower  bound  for  this 
number.  First  let  us  suppose  that  p  is  small  relative  to  and  n 
(  p  <  m  at  least).  Then  (P  )£m  +  -?.p-k  >  min  [(“mn),(n-p)]^(k)  2p“lc 

min[(mton),  (m-p ) ]  (3P  -  2P),  {.(miun)  <  (m-p)  if  and  only  if  m  >  n  ♦  p  }  or 

(max  [m  n+p]^  &P  ”  2P/  »  Thus  if  m  is  not  too  much  larger  than  n  ♦  p 


we  see  that  for  large  p  (but  still  small  relative  to  m)  the  new  extreme 
points  of  Szwarc  'b  problem  far  exceeds  the  total  number  of  extreme  points  in 

4 

Federowicz  's  problem,  due  to  the  weight  of  the  3P-  In  fact,  if 

m  >  n  +  p,  (nmn )  (3P  -  2P)  >  2P  (mmn)  for  all  p  >  1  and  the  inequality 

* 

becomes  stronger  as  p  increases,  how  the  problem  is  to  estimate  the 
number  of  these  new  extreme  points  which  lie  above  the  set  considered  by 
Federowicz.  This  number  will  certainly  be  far  less  than  the  total  number 
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of  pivots  that  Szwarc'a  algorithm  finally  makes  since  each  of  these  points 

must  be  visited  at  least  once,  and  probably  more  often  since  the  algorithm 

will  have  to  retrace  its  steps  to  get  to  the  next  lowest  vertex.  Now,  if 

this  number  is  just  as  great  as  the  original  set  considered  by  Federowicz, 

i.e.,  2P  (mmn),  by  proportion  to  the  total  number  of  points  in  the  set, 

i.e.,  c  (3P  -  2P)  (mmn)  [for  some  constant  c]  we  see  that  for  sufficiently 

large  p  Federowicz's  method  will  be  much  superior,  since  at  the  very 

met  it  Considers  2P  (mmn)  extreme  points  and  usually  not  even  that  many. 

Now,  let  us  suppose  that  p  is  large  relative  to  m  or  n.  It  is  diffi  - 

cult  to  see  Just  what  will  happen  always.  However,  with  Szwarc’a  method  we 

can  see  that  the  additional  extreme  points  do  not  grow  as  3P  as  before 

since  in  the  sum  there  are  only  la  terms.  In  fact,  the  number  of  additional 

m  , 

extreme  points  is  *$•  ( p  )  / m  n  )  2P~  .  Vhereas  In  Federowicz ’s  they  are 
k?L  k  '  m-k ' 

still  2P  (m:in),  which  is  now  getting  quite  large.  In  fact,  let  us  say  that 
there  is  just  one  original  constraint,  ie.,  a  ■  1,  and  that  p^2n  ♦  2,  but 
still  large.  Then: 

rP  )  2p"k  -  P  2P  1/2  <  (n+1)  2p  -  2p  . 
k-1  Vk  Sn-k7  n  ' 

Now  we  see  that  since  (mnn)  ■  n  ♦  1  is  a  relatively  small  number  that  the 
"help"  Federowicz 's  method  gets  from  the  simplex  algorithm  will  be  relatively 
small  and  that  the  rules  for  determining  how  the  sets  are  broken  up  will  tell 

how  fast  the  algorithm  works.  If  the  example  is  any  indication  we  see  that 

% 

at  least  somewhere  on  the  order  of  2P  pivots  will  have  to  be  made.  In 
S 'ware  ’  s  case  we  see  that  no  more  than  p  sP  1/2  different  extreme  pointe  will 
be  visited,  and  there  will  probably  be  much  less.  Thus  if  too  many  are  not 
revisited  we  see  that  Szwarc'a  method  should  be  superior. 
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This  brings  up  an  important  point.  Much  depends  on  Just  how  "good" 

Szwarc ' o  method  is.  We  have  made  an  estimate  of  the  number  of  different 
extreme  points  to  be  visited,  but  obviously  much  depends  on  how  good  this 
estimate  is  and  how  often  we  revisit  an  old  vertex.  If  It  turns  out  that 
many  vertises  are  often  revisited,  we  may  be  able  to  remedy  the  situation 
by  changing  the  algorithm.  It  would  be  to  remember  instead  the  tableau 
at  any  vertex  it  might  have  occasion  to  return  to,  and  then  Jump  directly 
from  vertex  to  vertex  without  having  to  pivot  over  and  over.  Each  time  a 
pivot  -is  made  it  would  be  to  a  new  vertex.  Or  the  algorithm  could  be 
changed,  differently.  It  would  be  a  mixture  of  both  waye.  That  is,  only  a 
few  "key"  vertices  would  have  their  tableaus  remembered.  If  we  are  in  one 
oVid  of  the  set  and  we  wish  to  go  to  the  other,  we  first  Jump  to  the  tableau 
of  the  nearest  "key"  vertex  (nearest  in  the  sense  of  the  number  of  pivots) 
and  then  pivot  to  the  desired  vertex..  Vie  now  see  that  the  revised  algorithm 
v^lII  more  nearly  resemble  Federowicz'a  algorithm  due  to  the  storage  of  the 
tableaus,  and  in  my  opinion  present  a  fairer  comparison  of  the  two  algorithms. 

Now  one  more  interesting  special  case  is  when  n  ■  0,  that  is,  all  the 
variables  are  Boolean.  In  Fedarovd.cz ' s  algorithm  the  number  of  extreme 

\  p 

points  ie  just  2  .  However,  there  is  no  application  of  the  simplex 
algorithm  and  the  problem  is  Just  to  find  a  feasible  solution.  Once  again 
the  algorithm's  speed  depends  only  on  the  choice  of  the  rules  for  set 
subtraction.  In  Szwarc ' a  algorithm,  however,  the  simplex  algorithm  ie 
applied  and  we  may  hope  for  a  shorter  solution.  The  one  drawback  is  that 
the  set  of  extra  points  is  much  larger  than  2P.  To  see  this  suppose  m.  <  p, 
then:  ^  (v  \(  m  \  ?p~k  >  p  /m)  .?p~k-  2P  (p  2~®  -  1)  >  2P  if  m  is 

v  k-l^k' V-k'  k-1  ^ 

small  enough,  i.e.,  m  <  p  -  1.  However,  we  would  suspect  that  if  Szwarc 'b 

method  works  at  all,  it  should  work  in  this  case,  and  should  compare  favorably 
with  Federcwicz's  method. 


Now,  if  it  turns  out  that  both  algorithms  do  fairly  well  with  the  same 
problem,  and  they  both  take  about  the  same  amount  of  time,  we  might  be  able 
to  obtain  a  faster  algorithm  by  mixing  the  two,  and  use  each  one  to  help 
the  other.  The  way  it  would  work  is  as  follows.  At  each  step  or  pivot  we 
would  make  one  pivot  in  the  Szwarc  algorithm,  and  then  one  pivot  in  the 
Federowicz  algorithm.  If  in  Szwarc's  method  we  come  across  a  possible 
solution  to  problem  I  (  but  hat'  not  pivoted  to  it  yet  ),  that  is  it  has  a 
maximum  better  than  any  of  the  maximi  already  extracted,  and  in  the  group 
that  is  to  be  next  broken  up  we  may  extract  this  set  instead  of  those  de¬ 
termined  by  the  normal  rules.  Also,  if  in  Szwarc's  method  there  are  members 

f 

of  the  list  L  that  have  a  maximum  lees  than  the  current  value  of  the  best 
maximum  in  Federowicz's  method,  we  can  eliminate  it  from  the  list  L,  since 
it  will  never  be  pivoted  to  anyway.  Also,  if  we  decide  to  atop  before  the 
final  solution  is  found,  the  current  value  of  z  -  t  in  Szwarc's  method 
or  the  greatest  upper  bound  in  Federowicz “s  method  will  serve  as  an  upper 
bound,  and  the  best  point  found  by  Szwarc's  method  that  is  f easible  in  prob¬ 
lem  I  or  the  current  value  of  Federowicz 's  method  will  serve  as  a  lower 
bound  on  the  final  solution.  These  can  be  used  to  estimate  the  solution  if 
it  is  computationally  infeasible  to  go  on,  and  it  can  be  seen  that  the 
appropriate  bounds  also  apply  to  each  separate  algorithm  as  well  as  the 
composite  one  Just  described.  Also,  instead  of  applying  them  both  at  once 
one  might  be  inclined  to  try  first  one  then  the  other  if  the  first  proves 
infeasible  computationally. 

Laetly,  it  is  easy  to  see  that  both  methods  can  be  used  to  find  secondary 
maximi.  That  is  the  second,  third,  etc*  ,  best  value  that  the  function  can 
take  on  and  still  satisfy  the  constraints.  In  Federowi.-z's  algorith,  we 


just  ignore  the  fact  that  the  two  sequences  have  met  and  throw  out  the 

d 

point  where  they  have  crossed  and  continue  on.  In  Szvarc's  method  »11 
we  need  to  do  is  forget  that  the  Boolean  constraints  were  satisfied  and 
continue  on  listing  those  that  do.  It  can  be  seen  that  not  only  will  all 
the  feasible  points  of  Federowicz's  problem  (problem  I)  be  listed,  but 
also  all  the  feasible  points  of  problem  II  as  well. 
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